Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
نویسندگان
چکیده
MicroRNAs (miRNAs) are small 20-24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire kingdoms, hence, their loci and function can be predicted by homology searches. Different studies have been performed to elucidate miRNAs using de novo prediction methods but due to complex regulatory mechanisms or false positive in silico predictions, not all of them express in reality and sometimes computationally predicted mature transcripts differ from the actual expressed ones. With the availability of a complete genome sequence of Gossypium arboreum, it is important to annotate the genome for both coding and non-coding regions using high confidence transcript evidence, for this cotton species that is highly resistant to various biotic and abiotic stresses. Here we have analyzed the small RNA transcriptome of G. arboreum leaves and provided genome annotation of miRNAs with evidence from miRNA/miRNA∗ transcripts. A total of 446 miRNAs clustered into 224 miRNA families were found, among which 48 families are conserved in other plants and 176 are novel. Four short RNA libraries were used to shortlist best predictions based on high reads per million. The size, origin, copy numbers and transcript depth of all miRNAs along with their isoforms and targets has been reported. The highest gene copy number was observed for gar-miR7504 followed by gar-miR166, gar-miR8771, gar-miR156, and gar-miR7484. Altogether, 1274 target genes were found in G. arboreum that are enriched for 216 KEGG pathways. The resultant genomic annotations are provided in UCSC, BED format.
منابع مشابه
Identification and Characterization of MicroRNAs in Asiatic Cotton (Gossypium arboreum L.)
To date, no miRNAs have been identified in the important diploid cotton species although there are several reports on miRNAs in upland cotton. In this study, we identified 73 miRNAs, belonging to 49 families, from Asiatic cotton using a well-developed comparative genome-based homologue search. Several of the predicted miRNAs were validated using quantitative real time PCR (qRT-PCR). The length ...
متن کاملIntrogression of cotton leaf curl virus-resistant genes from Asiatic cotton (Gossypium arboreum) into upland cotton (G. hirsutum).
Cotton is under the constant threat of leaf curl virus, which is a major constraint for successful production of cotton in the Pakistan. A total of 3338 cotton genotypes belonging to different research stations were screened, but none were found to be resistant against the Burewala strain of cotton leaf curl virus (CLCuV). We explored the possibility of transferring virus-resistant genes f...
متن کاملGenome-wide characterization and expression analysis of the aldehyde dehydrogenase (ALDH) gene superfamily under abiotic stresses in cotton.
In plants, aldehyde dehydrogenases (ALDHs) function as 'aldehyde scavengers' by removing reactive aldehydes and thus play important roles in stress responses. To date, 30 ALDHs have been identified in Gossypium raimondii, whereas ALDHs have not been studied in Gossypium arboreum or in tetraploid cotton. In this study, we identified 30, 59 and 59 aldehyde dehydrogenase (ALDH) genes from G. arbor...
متن کاملGraP: platform for functional genomics analysis of Gossypium raimondii
Cotton (Gossypium spp.) is one of the most important natural fiber and oil crops worldwide. Improvement of fiber yield and quality under changing environments attract much attention from cotton researchers; however, a functional analysis platform integrating omics data is still missing. The success of cotton genome sequencing and large amount of available transcriptome data allows the opportuni...
متن کاملMolecular Markers and Cotton Genetic Improvement: Current Status and Future Prospects
Narrow genetic base and complex allotetraploid genome of cotton (Gossypium hirsutum L.) is stimulating efforts to avail required polymorphism for marker based breeding. The availability of draft genome sequence of G. raimondii and G. arboreum and next generation sequencing (NGS) technologies facilitated the development of high-throughput marker technologies in cotton. The concepts of genetic di...
متن کامل